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Abstract. Let s q (n) denote the sum of the digits in the q-ary 
expansion of an integer n. In 1978, Stolarsky showed that 

limmf £^!) = o. 

n-s-oc S2\Tl) 

He conjectured that, as for n 2 , this limit infimum should be 
for higher powers of n. We prove and generalize this conjecture 
showing that for any polynomial p(x) — a,hX h + a^-ix' 1 ^ 1 + • • • + 
a G 7L\x\ with h > 2 and a/j > and any base q, 

l iminf £iM^)) = o. 

n-Kx> s q (n) 

For any e > we give a bound on the minimal n such that the 
ratio Sq{jp{n))/s q {n) < e. Further, we give lower bounds for the 
number of n < N such that s q (p{n)) / s q (n) < e. 



1. Introduction 

Let q > 2 and denote by s q (n) the sum of digits in the g-ary repre- 
sentation of an integer n. In recent years, much effort has been made 
to get a better understanding of the distribution properties of s q re- 
garding certain subsequences of the positive integers. We mention the 
ground-breaking work by C. Mauduit and J. Rivat on the distribution 
of s q of primes [9] and of squares [10]. In the case of general polynomi- 
als p(n) of degree h > 2 very little is known. For the current state of 
knowledge, we refer to the work of C. Dartyge and G. Tenenbaum [3], 
who provided some density estimates for the evaluation of s q (p(n)) in 
arithmetic progressions. The authors [7] recently examined the special 
case when s q (p(n)) ~ s q (n). 
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A problem of a more elementary (though, non-trivial) nature is to 
study extremal properties of s q (p(n)). Here we will always assume that 

(1) p(x) = a h x h + a h _ x x h ~ x H V a E Z[x] 

is a polynomial of degree h > 2 with leading coefficient > 0. 
In the binary case when q = 2, B. Lindstrom [8] showed that 

/on r s 2 (p(n)) 

(2) hmsup — ■ = /i. 

n-^oo log 2 71 

In the proof of (j2J), Lindstrom uses a sequence of integers n with many 
l's in their binary expansions such that p(n) also has many l's. The 
special case p(n) = n 2 of (|2J) has been reproved by M. Drmota and 
J. Rivat [5] with constructions due to J. Cassaigne and G. Baron. 

On the other hand, it is an intriguing question whether it is possi- 
ble to generate infinitely many integers n such that p(n) has few l's 
compared to n. If this is possible, then this is indeed a rare event. It 
is well-known [UH2] that the average order of magnitude of s q (n) and 
s q (n h ) is 

In particular, the average value of s q (n h ) is h times larger than the 
average value of s q (n). 

In 1978, K. Stolarsky [TJ] proved several results on the extremal 
values of s q (p(ri)) / s q (n) for the special case when q = 2 and p(n) = n h . 
He showed that the maximal order of magnitude is 

c(/ i )(log 2 n) 1 - 1 /'\ 

where c(h) only depends on h. This result is best possible, which follows 
from the Bose-Chowla theorem [21 [6] . His proof can be generalized to 
base q and to general polynomials p(n). Although this generalization is 
straightforward, we include it here for completeness. Recall that p(n) 
may have negative coefficients as well. 

Theorem 1.1. Let p(x) G Z[zc] have degree at least 2 and positive 
leading coefficient. 

(1) If p(n) has only nonnegative coefficients then there exists C\, 
dependent only on p(x) and q, such that for all n > 2, 



s q {n) 



< ci(log n) 
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This is best possible in that there is a constant c' x , dependent 
only onp(x), such that 

s q {n) 

infinitely often. 

(2) If p(n) has at least one negative coefficient then there exists c 2 
and n , dependent only on p(x) and q, such that for all n > n , 

s q (p(n)) 



s q [n) 



< c 2 log. n. 



This is best possible in that for all e > we have 
s q {p{n)) 



s q (n) 

infinitely often. 



> (q - 1 - e) log n 



The proof of this result along with some useful preliminary results 
are given in Section [2j 

For the minimal order of s q {p{n))/ s q (n), Stolarsky treated the special 
case q = 2 and p(n) = n 2 . He proved that there are infinitely many 
integers n such that 

s 2 (n 2 ) < 4 (log log n) 2 
s 2 (n) — logn 

He conjectured that an analogous result is true for every fixed h > 2 
but he did "not see how to prove this" . 

Conjecture 1.2 (Stolarsky [H], 1978). For fixed h>2, 

liminf ^ = 0. 

rwoo S2{n) 

By naive methods, it can be quite hard to find even a single value 
n such that S2{n h ) < s 2 (^) for some h, let alone observe that the 
limit infimum goes to 0. For example, an extremely brute force cal- 
culation shows that the minimal n such that s 2 («. 3 ) < s 2 (n) is n = 
407182835067 w 2 39 . 

In Section [3] we prove and generalize Conjecture 11.21 

Theorem 1.3. We have 

lim inf S ^=Q. 

n->oo S q (n) 

In view of our generalization, it is natural to ask how quickly we can 
expect this ratio to go to zero. Recall that h = degp. 
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Theorem 1.4. There exist explicitly computable constants B and C , 
dependent only onp(x) and q, such that for alls with < e < h(4h + l) 
there exists an n < B ■ C 1 ^ with 

s q (p(n)) 



(n) 



< e. 



The proof of this result along with an explicit construction for B and 
C is given in Section |H As a nice Corollary to this result we have 

Corollary 1.5. There exists a constant Co, dependent only on p(x) 
and q, such that there exists infinitely many n with 



< 



Sg(n) logn 
This is an improvement and generalization upon (jj]). 

Proof. By solving for e in n < B ■ C 1 ^, one easily sees that e < 
log log Without loss of generality we may assume that B > 1, 
hence we can take Co = log C. □ 

One might expect that the ratio s q (p(n)) / s q {n) is small only rarely, 
with most of its time being spent near h = degp. It turns out that this 
ratio is small somewhat more often than expected. 

Theorem 1.6. For any e > there exists an explicitly computable 
a > 0, dependent only on e, p(x) and q, such that 

#(n<iV: S -^<e\>>N" 



where the implied constant also only depends on e, p(x) and q. 
The proof of this result is given in Section [5j 

In Section [6] we collect together questions raised in this paper and 
pose some further lines of inquiry for this research. 

2. Preliminaries and Proof of Theorem 11.11 

First we prove some preliminary results about s q which we need in 
the proofs. Recall (cf. [S]) that terms are said to be noninterfering if 
we can use the following splitting formulae: 

Proposition 2.1. For 1 < b < q k and a, k > 1, 

(5) s q {aq k + b) = s q (a) + s q (b), 

(6) s q {aq k -b) = s q {a - 1) + (q - l)k - s q (b - 1). 
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Proof. Relation (J5J) is a consequence of the (strong) g-additivity of s q . 
For 06]) we write b — 1 = Yli=o ^iO 1 with < 6j < q — 1. Then 

s 9 (ag fc - 6) = s,((a - l)q k + q k - b) = s q (a - 1) + s 9 (g fc - b) 

'ifc-i 



s g (a - 1) + s J ^(o - 1 - b^ 

\i=0 

fe-i 

l) + J>-l-6i) 



i=0 



implying (jBj). □ 

Proposition 2.2. The function s q is subadditive and submultiplicative, 
i.e., for all a,b £ N we have 

(7) s q (a + b) < s q (a) + s q (b), 

(8) s q (ab) < s q (a)s q (b). 

Proof. The proof follows on the lines of [121 Section 2]. As for (J7|), an 
even stronger result is true, namely that s q (a + b) = s q (a) + s q (b) — (q — 
1) • r where r is the number of "carry" operations needed when adding 
a and b. Writing b = Yli=o hq 1 we also have 

(fe-i \ fc-i 
1=0 / 8=0 

fc-i fc-i 

= z2 Sq ( ® + ~ " + a ) ^ s ( a ) ^2 

i=0 bi times i=0 
where we used twice the subadditivity of s q and we get (jSJ). □ 

Proof of Theorem This is an almost direct generalization of Sto- 
larsky's proof (see [HI Section 2]) and Propositions 12.11 and 12.21 First, 
suppose that p(n) has only nonnegative coefficients. Then using Propo- 
sition [272] we see that s q (p(n)) < p(s q (n)). Therefore 

s q jp(n)) < min{(g - 1) (log,p(w) + l) ,p(s q {n))} 

s q (n) ~ s q (n) 

min{log n,s q (n) h } 
9 < ci ^ 
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where c\ only depends on p(x) and q. If \og q n < s q {n) h then we have 
(log f? n) 1 / /l < s q (n). From this and (jHj), we get that 

s q (n) " Cl (tog,n)V* c ^ n > ■ 
Alternately, if log„n > s q (n) h then we have (log„ n) 1 ^ > s q (n) and 

< Cl . Sq (n) h -i < Cl (log q n?-V h 



For the lower bound, set 

(10) k = Llo gf ,(A(/i + l)!)J + l, 

where A = max{aj : < i < h}. By Stolarsky's use of the Bose-Chowla 
Theorem, there are infinitely many integers M > 3(k + 1) such that 
there are integers yi, y2, ■ ■ ■ , Vn with N := \_{M + l)/(k+ 1)J — 1, with 
the following three properties: 

(i) 1 < Vi < V2 < ■ ■ ■ < Vn < M h , 

(ii) yi = mod {k + 1), 

(iii) all sums yj 1 H — ' + Vj h are distinct {distinct sum property); here 
jii 32, ■ ■ ■ iju £ {1j 2, . . . , iV} with possible repetition. 

Note that (iii) implies the distinct sum property for all y^ + ■ ■ ■ + yj t 
with 1 < % < h. Now set 

N 

n = 

i=i 

such that 

h h 

(11) P (n) = oi-n* = J2Y1 h *> ■ ■ ■ ' M<f i/n+ - +yjvhAr 

where the summation ^2 ' is over all vectors (hi, . . . ,hj^) satisfying hi + 
• - • + hisr — i, and a(i; hi, ... , h^) denote the multinomial coefficients 
i\/(h\\ . . . h^-) bounded by i\. Consider (fTTj) as a polynomial in q. By 
the distinct sum property (iii) we have for all < i < h that 



#{yihi H h VNh N ■ h-\ h h N = i} 



N - 1 



Thus the coefficients of q y^+-+VNh N = q R w j tn ^ + ■ • • + h N = h 
in (fii| are nonzero and bounded by 

(12) a h h\ + a h _i(h - 1)! + • • • + a < X{h + l)h\ < q k . 
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By ( |T2|) and (ii), the sums y±hi + • • • + yNh-N = mod {k + 1) and 
hence the powers q R are noninterfering and we get 

s q (p(ri)) (N + h-l\ 1 N* 1 " 1 

s q {n) - v w-i y ' iV ~ w - ' 

By construction, 

hg q n <y N + l< 2 h+1 N h (k + l) h . 

The claim now follows by observing that k is largest for q = 2. 

Secondly suppose that p(n) has at least one negative coefficient. 
Then the first claim follows by observing that s q (p(n)) < |log„p(n)J +1 
for sufficiently large n. For the lower bound, denote by a,j the negative 
coefficient with smallest index j, i.e., aj < and Oj_/ > for 1 < I < j. 
Then for all sufficiently large k we have 

sM<l k )) = s q {a h q hk + ■■■ + a j+1 q ij+1)k + a 3 q jk + a^g^ -1 ^ + ■ ■ • + a ) 

3-1 

= s g (a fc g (/l - i)fc + • • • + a i+1 g fc + a^) + ^ s g (aj) 

> jfe(g - 1) - s(-a 3 - 1) 

> fc(g - 1 - e). 

Here we have used Proposition 12.11 As s q (q k ) = 1 and \og q (q k ) = k, 
the result follows. This completes the proof of Theorem 11.11 □ 

3. Proof of Theorem 11.31 

The proof of Theorem 11.31 will use a construction of a sequence with 
noninterfering terms. First assume that p(x) = x h , h > 2 and define 
the polynomial 

t m {x) = mx 4 + mx 3 — x 2 + mx + m 

where m G Z with m > 3. By consecutively employing (jSJ) and (jH]) we 
see that for all k with g fc > m, 

(13) s g (t m (g fe )) = (q - l)k + s q (m - 1) + 3s q (m). 

The appearance of k in ( TT3|) is crucial. The next lemma lies at the 
heart of the proofs. We will use it to see that s q (t m ( y q k ) h ), h > 2, is 
independent of k whenever k is sufficiently large. Furthermore, we will 
exploit the fact that the coefficients of [x l ] in t m (x) h are polynomials 
in m with alternating signs. 
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Lemma 3.1. For fixed h>2 and m > 3, we have 

t m (x) h = ^2ci,h( m ) xi 

i=0 

satisfying 

(14) < c ith {m) < (2mh) h % = 0, 1, . . . , Ah. 
In fact, we have 

(15) co,h{m) = c 4 h,h( m ) = "A c i,h{m) = c 4 h-i,h( m ) = hm h . 

Proof. A direct calculation shows that t m (x) 2 and t m (x) 3 have property 
(114"]) provided m > 3. Set h = 2h\ + 3/i2 with max(/ii, h?) > 1. Then 



tmi^C) t m {x) . . . t m {x^j • t m (x) . . . t m {x} . 



h\ times h 2 times 

Since products of polynomials with all positive coefficients have all 
positive coefficients too, we get c^hijn) > for all i = 0, 1, ... , Ah. On 
the other hand, the coefficients of t m (x) h are clearly bounded by the 
corresponding coefficients of the polynomial 



m », 1+I+I > +I . +I v =m » £ (*)(;.) (f) (f) 

Therefore, for all i with < % < Ah, we have 

h\ 

(16) Ci.fcH < m h 



i) / /, , ^(/i-OKi-jW-W-O! 

< m h W exp(/i — i + i— j+j — k + k — I) 

< m h h\e h < {2mh) h . 

□ 

Proof of Theorem[LE Now let fc be such that g fc > (2mh) h . By CHI) 
and (jSJ) we then have 

s q (tm(q k ) h ) = Sq(cQ >h (m)) + s q (c 1)h (m)) H h s ? (c 4 ? l ,/ ! ,(m)) 

where s g (c{ ( fe(m)) is bounded by a function which only depends on q, 
m and h. Together with f[T3"j) and letting fc — > oo we thus conclude for 
fixed m > 3, 



lim s g (t m (gT)A ? (*m(9 )) = 0, 

fe— >oo 



as wanted. 
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Finally we consider the case with a general polynomial instead of x h . 
Write 



(17) p(t m (x)) = a h t m (x) + a h ^it m {x) H h a x t m {x) + a 

where > and h > 2. First suppose that all the coefficients are 
nonnegative. Lemma 13.11 shows that for i with 2 < i < h all the 
coefficients of t m (x) 1 are positive. Also, the coefficient [x 2 ] in p(t m (x)) 
is nonnegative if we choose m > 3 sufficiently large. In fact, a sufficient 
condition is ({^)Tn h ~ hm 11 ^ 1 ) > a\ which is true whenever 



m > 



2ai 



h(3h — 5)dh 



i/(h-i) 



If the polynomial p(x) has negative coefficients then there is a positive 
integer b such that the polynomial p(x + b) has all positive coefficients. 
A good choice for b is 







" A " 


[i + Al 


= 1 + 




ah 




ah 



(19) 



This is easy to see since both p(x 
and 



A 



b) 



,a h {x 



max{|aj| : < i < h}. 



h-l 



a h (x 



+ b) h -\Y,(x + by 



i=0 



a h 



x + b-1 



{x + b) h + 



A 



x + b 



x + b — 



- ((a h x + (b- l)a h -\){x + b) h + A) 



have nonnegative coefficients when b > 1 + — . Thus if q k > m + b then 
s q (t m (q k ) + b) = (q — l)k + s q {m — 1) + 2s q {m) + s q {m + b) and one 
similarly obtains for fixed m, 

lim s q {p{t m {q k ) + b))/s q {t m {q k ) + b)=0. 



This completes the proof of Theorem 11.31 



□ 



4. Proof of Theorem 11.41 

The construction of an extremal sequence in the proof of Theorem II .31 
gives a rough bound on the minimal n such that s q (n h ) < s q (n). We 
first illustrate the method in the case q = 2, h = 3. 
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Set m = 3. Then for all k with 2 k > max q hi™) = 225 we have 

0<«<4/i ' 

s 2 (t 3 (2 fc )) = k + l + 6 = k + 7, 

s 2 (t 3 (2 k ) 3 ) = 2- (4 + 3 + 4 + 4 + 4 + 4) + 4 = 50. 

Therefore, by setting k = 44, we get 

min{n : s 2 (n 3 ) < s 2 {n)} < 2 178 . 

It is possible to show that the minimal such n to be n = 407182835067 
2 39 . 



Proof of Theorem \l-4\ Consider the general polynomial 

p(x) = a h x h + a h _\X h ~ x + • • • + a € l\x\ 

with ah > 0, h > 2. Let A = max |aj|. Pick 6 such that p(x + b) has only 
nonnegative coefficients, as in ( 1191) . Pick m > 3 such that p(t m (x) + b) 
has only nonnegative coefficients, as in ( ITBl) . Our task is to bound the 
coefficients of of p(t m (x) + b) G Z[x}. 

To begin with, we estimate the coefficient of x l , < i < h of p(x + 6), 



(20) 



a i V 



j=i 



< 



E 



< X(2b) h . 



Combining (|20|) with ( IT5|) . we find that the constant term of p(t m (x)+b) 
is bounded by 

h 



h+l 



1 



i=0 



m — 1 



< Xh(4mbh) h 



since m > 3 and h > 2. Again from (|2"U|) and f[T4l) . we find that the 
other coefficients of p(t m (x) + b) are bounded by 



(21) 



\(2b) h J2( 2mi Y < Xh(4mbh) h . 



i=l 



Therefore the coefficients of p(t m (x) + b) are bounded by \h(4mbh) h . 
Hence for q k > m + b, we have 

'log(A/i(4m&/i) /l ) 



(22) sMtUq )) +b)<(q- l)(4/i + 1) 



logg 



On the other hand, we clearly have s q {t m (q k ) + b) > (q — l)k for 



q k > m + b. Let 



k 



4/i + l /log(A/i(4m6/i) A ) 
logg 



+ 1 



+ 1. 
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Then for < e < h(Ah + 1) we have q k > m + b and hence 

s q {p{t m {q k ) + 6)) 



s q {t m (q k ) + b) 



< e. 



Therefore, 



minln: S -M^l <e\<t m {q k ) + b 



< m(q Ak + q 3k + q k + 1) 

< 2mq 4k 

<2mq 4 (q\h(4mbh) h f 6h+4)/£ . 

Setting B := 2mq A and C := (q\h{4mbh) h ) + , it gives the desired 
result. □ 

5. Proof of Theorem 11.61 

We start our analysis with the simple case of p(n) = n h . Let t m (x) = 
mx 4 + mi 3 — x 2 + rax + m as in Section [3] Letting n = = t m (q k ) 
we see from equation f[T5j) that, for m < q k , 

s q (n) = (q — l)k + s q (m - 1) + 3s g (m) > (q — l)k. 

If m has i g-ary digits then n will have 4k + i g-ary digits. We see that 
tm(<l k ) h is of length at most h(4k + i). 
Let t m (q k ) h = Zjto c i<l kj - These 

Cj are dependent upon m and h, but 
are independent of k for k sufficiently large. We see from equation (TL6l) 
that Cj < (mh ■ 2) h and hence has at most hi + h log q h + h g-ary digits. 
As there are (4h+l) coefficients Cj and s q (cj) < (q — 1) (hi+h \og q h+h), 
we get 

s q (n h ) < (q-l)(4h + l) (hi + h\og q h + h) . 
Combining these together we have 

s q (n h ) (q-l)(4h + l) (hi + h\og q h + ti) 
s q (n) ~ (q~l)k 

(4h+ 1) (hi + h\og q h + h) 
= k ' 

Without loss of generality suppose that < e < h(4h + 1). Let ko be 
large enough so that k > i and 

(Ah + 1) (hi + h \og a h + h) 
— < e. 
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For i sufficiently large, we can take k = ( ih + 1 )( hl + l ) _ Then this says 

that for every sufficiently large m having i q— ary digits, there is an 
integer n having 4/c + i q— ary digits such that 

s q {n) 

Moreover, by construction, each distinct m will give rise to a distinct 
n. Letting 

% % f 

a = — : > 



4k + i ~ 4(4/i + l)(h + l)i/e + i 4(4/i + l)(h + 1) + e 
we get as N — > oo that 

#L<N: < e \ » 

Now to extend this for general p(x), we proceed as we did in the 
proof of Theorem 11.31 First consider the case where p(x) has only non- 
negative coefficients. There is a lower bound on m such that p(t m (x)) 
will have only nonnegative coefficients and we proceed as before, after 
which the result follows as before. Second, if p(x) has at least one 
negative coefficient, then consider instead the polynomial p(x + b) for 
sufficiently large b, which will have only nonnegative coefficients, and 
the result follows. 



6. Conclusions and further work 

All results in this paper have explicitly computable constants for 
existence or density results. Many times these constants are far from 
the observed experimental values, and it is quite likely that many of 
them may be strengthened. Examples include Theorems 11.41 and 11.61 

Some obvious generalizations of this problem are in looking at the 
ratios of Sq( - Pl( - n \\ or even more generally of — with respect to two 

s«(pa(n))' ta J s q2 { P2 {n)) r 

different bases qi,q2- Alternately, instead of looking at polynomials 
p(x) G Z[x], we could look at quasi-polynomials Lp( ra )J with p(x) G 
R[x}. 

As another direction, we could consider expansions in other numera- 
tion systems, e.g. the Zeckendorf expansion (or expansions with respect 
to linear recurrences) or the balanced based q representation. In the lat- 
ter case, for example, 11 = 1 • 3 2 + 1 • 3 1 — 1 -3°, and 4(11) = 1 + 1-1 = 1, 
being the sum-of-digits function in this representation. This value will 
quite often be 0, but its extremal distribution could still have some 
interesting properties. 
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